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In (+) RNA coronaviruses, replication and transcription 
of the giant ~30kb genome to produce genome- and 
subgenome-size RNAs of both polarities are mediated by 
a cognate membrane-bound enzymatic complex. Its RNA- 
dependent RNA polymerase (RdRp) activity appears to 
be supplied by non-structural protein 12 (nsp12) that 
includes an RdRp domain conserved in all RNA viruses. 
Using SARS coronavirus, we now show that coronaviruses 
uniquely encode a second RdRp residing in nsp8. This 
protein strongly prefers the internal 5’-(G/U)CC-3’ tri- 
nucleotides on RNA templates to initiate the synthesis of 
complementary oligonucleotides of <6 residues in a reac- 
tion whose fidelity is relatively low. Distant structural 
homology between the C-terminal domain of nsp8 and 
the catalytic palm subdomain of RdRps of RNA viruses 
suggests a common origin of the two coronavirus RdRps, 
which however may have evolved different sets of catalytic 
residues. A parallel between the nsp8 RdRp and cellular 
DNA-dependent RNA primases is drawn to propose that 
the nsp8 RdRp produces primers utilized by the primer- 
dependent nsp12 RdRp. 
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Introduction 


Viruses of the plus-strand (+) RNA Coronaviridae family 
employ the largest multicistronic RNA genome composed 
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of a single segment of 27-32kb. This family includes the 
Coronavirus and Torovirus genera that together with the 
distantly related Arteriviridae and Roniviridae families form 
the Nidovirales order (Gonzalez et al, 2003; Draker et al, 2006 
and reviewed in Lai and Holmes, 2001; Siddell et al, 2005). 
The coronavirus genome RNA is translated to express the two 
most 5’ open reading frames (ORFs), ORFla and ORF1b, 
occupying approximately two-thirds of the genome. They 
encode major subunits of a poorly characterized mem- 
brane-bound complex that mediates the synthesis of the 
genome RNA (replication) and numerous subgenomic 
RNAs (transcription) (Gorbalenya, 2001; Thiel et al, 2001; 
Gosert et al, 2002; Prentice et al, 2004; Ziebuhr, 2005). ORFla 
encodes the replicative polyprotein (pp) 1a, and ORFs 1a and 
1b together encode pplab. Expression of the ORF1b requires 
a -1 ribosomal frameshift just upstream of the ORFla stop 
codon. The two polyproteins ppla and pplab are autocata- 
lytically processed by at least two proteases to produce up to 
16 end products called non-structural proteins (nsp) 1 to 16, 
as well as multiple processing intermediates (Thiel et al, 
2003). The central and C-proximal regions of the polyproteins 
are processed at 11 conserved sites by the chymotrypsin-like 
proteinase known also as main proteinase (Mpro) that resides 
in nsp5 (reviewed in Ziebuhr et al, 2000). 

The unique features of coronavirus RNA synthesis may be 
linked to the viral proteome bearing multiple enzymatic 
activities absent in other RNA viruses (Snijder et al, 2003). 
Replicase polyproteins of nidoviruses share a conserved array 
of functional domains (from the N- to C-termini): Mpro, RNA- 
dependent RNA polymerase (RdRp) (Cheng et al, 2005), zinc- 
binding domain-containing helicase (ZBD-HEL) (Ivanov et al, 
2004b) and uridylate-specific endoribonuclease (NendoU) 
(Bhardwaj et al, 2004; Ivanov et al, 2004a). In coronaviruses, 
RdRp, ZBD-HEL and NendoU reside in nsp12, nspl3 and 
nsp15, respectively. In large nidoviruses, such as corona-, 
toro- and roniviruses, the conserved replicase polyprotein 
backbone is elaborated with additional enzymatic domains, 
including an exoribonuclease (ExoN) (Snijder et al, 2003; 
Minskaia et al, 2006) and a putative ribose-2’-O-methyltrans- 
ferase (Feder et al, 2003; Snijder et al, 2003), which in 
coronaviruses, reside in nspl4 and nspl6, respectively. 
Despite recent progress in the characterization of replicase 
proteins, a large fraction of these proteins encoded in ORFla 
(nsp1, nsp2, some nsp3 domains, nsp4, nsp6, nsp7, nsp8 and 
nsp10) remains functionally poorly characterized. 

The crystal structure of a complex between the 197 a.a. 
nsp8 and the 83 a.a. nsp7 protein was recently determined for 
SARS coronavirus (SARS-CoV) (Zhai et al, 2005). It revealed a 
hexadecamer in which eight nsp7 molecules act as a mortar 
that holds the nsp8 octamer. RNA binding experiments with 
these subunits and the overall architecture of the complex 
suggest that the later encircles RNA. These authors have 
proposed that this complex provides a platform that might 
confer processivity to the synthesis of large RNAs in corona- 
virus nsp12 RdRp-mediated replication and transcription. 
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This RdRp is phylogenetically clustered with RdRps of (+) 
RNA viruses whose genome RNAs have the 5’-end covalently 
linked to a viral protein genome-linked (VPg) (Koonin, 1991). 
For the well-studied poliovirus RdRp, covalent (oligo)nucleo- 
tide-VPg complex serves as a primer to direct the template- 
dependent synthesis of RNAs by RdRp (Paul et al, 1998). 
RdRps of the VPg-containing viruses share a conserved 
sequence motif G that was implicated in the recognition of 
primer-template RNA complex (Barrette-Ng et al, 2002; 
Gorbalenya et al, 2002; Thompson and Peersen, 2004). The 
motif G was also identified in RdRps of coronaviruses 
(Gorbalenya et al, 2002), and accordingly, the SARS-CoV 
RdRp was shown to be primer-dependent (Cheng et al, 
2005). Since the 5’-end of genome RNA of coronaviruses is 
blocked by a cap rather than a VPg (Siddell et al, 2005), these 
data indicate that coronaviruses may have evolved a special 
mechanism to generate primers used in RNA synthesis. 

Viruses and cellular organisms have evolved a number of 
mechanisms to produce RNA primers, which are either 
derived from pre-existing RNAs (Plotch et al, 1981; Mak 
and Kleiman, 1997) or synthesized on a template. In cellular 
organisms, template-derived RNA primers direct the DNA 
synthesis at both leading and lagging strands. A specialized, 
low-fidelity DNA-dependent RNA polymerase, known as 
primase, catalyzes the synthesis of short (4-15 nucleotides) 
RNA molecules to be used as primers by replicative DNA 
polymerases; these primers are subsequently removed by 
ribonucleases (Frick and Richardson, 2001). Like other and 
structurally unrelated template-dependent polynucleotide 
polymerases, primases catalyze the formation of internucleo- 
tidic phosphodiester bonds using a common phosphoryl 
transfer reaction. 

In this report, we describe a new template-dependent 
oligonucleotide-synthesizing activity possessed by nsp8 of 
SARS-CoV. This protein recognizes a specific short sequence 
ubiquitous in the ssRNA coronavirus genome to catalyze the 
synthesis of complementary <6 nucleotides with a relatively 
low fidelity similar to that of known DNA-dependent RNA 
primases. 


Results 


The characterization of the SARS-CoV nsp8 (hereafter nsp8) 
protein described in this paper was initiated to test a possible 
modulating role of this protein for the poorly active nsp12 
RdRp. Surprisingly, nsp8 alone was found to be able to 
catalyze template-dependent oligoribonucleotide synthesis. 
This observation prompted a detailed analysis of nsp8 prop- 
erties related to this activity. The obtained results are 
described below. 


Coronaviridae nsp8 forms an isolated protein family 

Nsp8 protein is well-conserved among viruses of the 
Coronavirus genus with 17 and 66% positions being occu- 
pied by identical and similar, respectively, residues. Further 
database searches using single- and multiple-sequence 
queries and assisted by BLAST (Altschul et al, 1997) and 
HMMER (Bateman et al, 1999) programs identified no statis- 
tically significant similarity of nsp8 with other proteins. 
However in the profile searches, we observed an under-the- 
threshold hit with an equivalently located region in the 
ORFla-encoded polyprotein of the Breda bovine torovirus 
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(BToV-1) (Draker et al, 2006). In the torovirus ppla/pplab 
polyproteins, this region is flanked by two potential 3CL 
cleavage sites, which may be recognized by Mpro (Figure 1 
and AE Gorbalenya, personal communication). These data 
indicate that the identified region may be a very distant 
ortholog of nsp8. An alignment of coronavirus nsp8 and the 
putative BToV-1 ortholog identified eight absolutely conserved 
residues, which include Lys-58, Lys-82 and Ser-85 (Figure 1). 


The SARS-CoV nsp8 has RdRp activity 

Purified nsp8 (Supplementary Figure 1A) was assayed for 
RdRp activity using homopolymeric RNA _ templates 
(poly(rA), poly(rU) and poly(rC)) in a filter binding assay 
(Dutartre et al, 2005). A weak polymerization with poly(rU) 
template and no activity using poly(rA) template were ob- 
served (data not shown). A strong activity using poly(rC) and 
oligo(rC,;) template (Figure 2) was detected, with a linear 
polymerase activity after up to 4h of incubation, in a tem- 
plate-dependent manner. The activity was found to be man- 
ganese-dependent, although magnesium but not Zn**, Co** 
nor Ca** could also promote RdRp activity with a much 
lower efficiency (Supplementary Figure 1B). Nsp8 prepara- 
tions are insensitive to rifampicin, a potent Escherichia coli 
RNA polymerase inhibitor. Nsp8 polymerase activity is free of 
contaminating or intrinsic terminal transferase (TNTase) 
activity. Indeed, a TNTase activity assay carried out with 
5’-labeled oligo(rC,;) and unlabeled guanine triphosphates as 
substrates shows no ladder-like products above the labeled 
oligo(rC,;) template (data not shown). The RdRp activity was 
blocked by mutations at the most conserved positions of nsp8& 
(see below). Thus, the RdRp activity is catalyzed by nsp8& 
rather than a co-purified protein. 

Primer dependence of the nsp8 RdRp was investigated 
using poly(rC) or oligo(rC,s) templates, which were annealed 
to labeled oligo(rG) primers of one of three sizes (G2, G6 or 
G15). Elongation of the labeled primers by nsp8 was not 
detected (not shown). An ability of the nsp8 RdRp to synthe- 
size RNA in the primer-independent mode was compared 
to well-established primer-independent RdRps, the HCV 
NS5SB and the Dengue virus (DV) NSS (Ackermann and 
Padmanabhan, 2001; Kao et al, 2001). In this assay, an 
oligonucleotide of 15 consecutive cytidines (oligo(rC,;5) and 
GTP) were used as the template and the sole nucleotide 
substrate, respectively, to direct RNA synthesis. Figure 2 
shows the synthesis of oligo(rG) products for each polymer- 
ase. The size pattern of products synthesized by nsp8 is 
similar to those that were produced using HCV NSSB and, 
to a lesser extent, DV NSS. In the reactions catalyzed by either 
nsp8 or HCV NSSB, a similar pattern of products up to 7-mers 
was found. The most prominent was the accumulation of 
pppGpG, which is a hallmark of a kinetic limitation in the 
initiation step of RNA synthesis. However and in contrast to 
NSSB and NSS, nsp8 was unable to catalyze the synthesis of 
the full-length complementary copies of the template. The 
seemingly full-length products are shorter and accumulated 
to a lesser extent than those produced by HCV NSSB or DV 
NSS. These results indicate that nsp8 is an RNA-directed, 
primer-independent RdRp acting in a distributive fashion. 


Nsp8 is a sequence-specific RdRp 
The nsp8 RdRp activity is dependent on the nucleobase of 
a homopolymeric template (poly(rA), poly(rU) or poly(rC)). 
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Figure 1 Sequence alignment of nsp8 proteins. The alignment of coronavirus nsp8 sequences was generated with the ClustalW program, 
version 1.82 (http://www.ebi.ac.uk/clustalw/). This alignment and individual nsp8 sequences were used to search sequence databases as 
described in Snijder et al (2003). Using results of these searches, the original alignment was extended to a distantly related (see text) torovirus 
sequence using the MUSCLE program. The resulting alignment was converted into this figure using the ESPript program, version 2.2 (http:// 
espript.ibcp.fr/ESPript/cgi-bin/ESPript.cgi). Residues that are conserved in all or >70% sequences are boxed in red and yellow, respectively. 
Above the alignment, numbering and secondary structure elements (Zhai et al, 2005) for SARS-CoV nsp8 protein are depicted. National Center 
for Biotechnology Information (NCBI) Accession Numbers for replicase polyprotein sequences including nsp8: SARS-CoV, NP_828866; Breda 
torovirus (BToV-1), AY427798; Transmissible Gastroenteritis coronavirus (TGEV), NP_840006; Porcine epidemic diarrhea virus CV777 (PEDV), 
NP_839962; Human coronavirus NL63 (NL63), AAS58176; Human coronavirus 229E (229E), NP_835349; Human coronavirus HKU1 genotype 
A (HKU1), AAT98578; Human coronavirus OC43 (OC43), NP_937947; Bovine coronavirus (BCoV), NP_742135; Mouse Hepatitis virus strain 
AS9 (MHV-A59), NP_740613; Avian Infectious Bronchitis Virus (AIBV), NP_740626. 


A significant nsp8 RdRp activity was only detected using 
poly(rC) template. Since poly(rG) template generates strong 
secondary structures, it cannot be tested in a similar poly- 
merase assay. To analyze whether guanines could be part of a 
template for nsp8, the heteropolymeric RNA template, 5’-UAU 
AAU GGA AAA-3’ (oligo 5), containing two internal guanines, 
was tested. No nsp8 RdRp activity was detected using this 
template. Based on these and other experiments (see below), 
we concluded that the template must contain at least one 
cytosine to promote RNA synthesis by nsp8. A 373-nt hetero- 
polymeric RNA template identical to a part of the SARS-CoV 
genome was used to monitor RdRp activity, and the results 
are presented in Figure 3A. Nsp8 is able to synthesize only 
short RNA products (<6-mers), whereas HCV NSSB RdRp is 
able to synthesize both short and long products. The nsp8& 
heterogeneous product profile suggests various internal in- 
itiation sites (see below). Various short synthetic RNA oligo- 
nucleotides were then designed to refine relevant template 
requirements for an efficient synthesis of RNA products by 
nsp8 (Table I). 
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Two sets of oligonucleotides pairs were designed, each 
containing two adjacent cytidines either internally or at the 
3'-end of a template. Cytidines (underlined below) were 
placed downstream of either U (lst set—oligo 1: 5’- 
AAAAAAAAAGUAUCC-3’ and oligo 2: 5’-UAUAAUCCGAAA- 
3’) or G; 2nd set—oligo 3: 5’-AAAAAAAAAAUUGCC-3’ and 
oligo 4: 5’-UAUAAGCCAAAA-3’). With both sets, product 
synthesis was only observed using templates bearing internal 
CC (oligo 2 and oligo 4). Thus, nsp8 is unable to start the 
synthesis of a complementary sequence at the 3’-end of a 
linear RNA template and it requires a template cytidine to be 
flanked by at least two nucleotides from the 3’-end. 

The specificity at the + 2 and —1 positions relative to the 
template 3’-cytidine was then addressed. All the templates 
tested in this study and the results regarding the polymerase 
assay are shown in Table I. Sequences that include 
the cytidine-containing trinucleotides (5’-3') -CCU-, -ACG-, 
-ACU-, -UCG-, -ACA-, -UCA- or -ACC- proved to be poor 
templates for nsp8 (Table I). Rational design of test sequences 
showed that the minimal sequences requirements are UCC, 
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Figure 2 Kinetics of RNA synthesis using nsp8 protein, NS5SB 
polymerase of HCV and NSS polymerase of Dengue virus and an 
oligo(rCjs;) RNA template. 1uM of nsp8, 14M HCV NSSB and 
400nM of Dengue NSS proteins were mixed with [a-**P]GTP 
(10 1M) and oligo(rC,;) template (101M) to start the reaction (as 
described under ‘Materials and methods’). At various times (0, 1, 15 
and 30min), reaction aliquots were quenched by the addition of 
EDTA/formamide, and the RNA products were resolved on 14% 
polyacrylamide/7 M urea gel. Due to the likely formation of 
poly(rG) secondary structures, product size and abundance is not 
visually accurate for >8-mer products. M, RNA marker synthesized 
using T7 RNA polymerase and an appropriate template; each band 
product is indicated on the left. 


GCC, GCA, UCU and GCU in order to promote activity, 
with the optimal sequence being 5/-(G/U)CCNN-3’ 
(Figure 3B, and not shown). Synthesis starts on the 5’ C 
leaving one cryptic 3’ C. Figure 3B (left lanes) shows nsp8- 
dependent band products formation, using oligo 10 template 
(5'-UAUAGUCCCAAA-3’). Products accumulate over time, for 
each incorporation, confirming that nsp8 acts in a rather 
distributive fashion. Taken together, these results demon- 
strate that nsp8 is a sequence-specific RdRp. 

As GTP is the required initiating (+ 1) nucleotide, we used 
the oligo 10 template (Table I) to measure K,,(GTP) at 
position +1. An overall incorporation efficiency Vmax/Km 
of 0.65x10-°min~'pM~' was measured for GTP, with 
Km(GTP) of 126+14yM, a value comparable to that of 
other viral RdRps (Ranjith-Kumar et al, 2002; Castro et al, 
2005; Selisko et al, 2006). This latter value is significantly 
lower than the intracellular GTP concentration (1-4mM) 
(Hauschka, 1973), suggesting that the cellular GTP con- 
centration is not rate-limiting for the nsp8-mediated 
RNA synthesis initiation reaction. The ATP and CTP 
incorporation efficiencies at the +2 position were on the 
same order of magnitude, 1.7 x 10-° min-'wM7! and 0.65 x 
10-° min~! uM, respectively. 

We examined the nsp8 nucleotide insertion fidelity under 
conditions where the reaction mix assay specifically lacked 
one of the required NTP. Oligo 10 (Table I) was used with four 
different sets of nucleotides (Figure 3B). In the presence of 
the four NTPs, the synthesized product should correspond to 
5'-GACUAUA-3’ (Figure 3B, left lanes). In contrast, when ATP, 
UTP or CTP are omitted, a full size product is not detected, 
indicating that in the absence of the required NTP, nsp8 does 
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Figure 3 Nsp8 polymerase template requirements. (A) Time course 
of nucleotide incorporation using a purified 373 nt RNA template 
and 1 1M nsp8 (left panel) or 1 1M HCV NSSB (right panel) together 
with 500uM ATP, UTP, CTP, and 101M [o°**P]GTP. Reaction pro- 
ducts are indicated on the side of the gel autoradiograph. Positions 
of the RNA markers (M) synthesized using T7 RNA polymerase are 
shown on the left. (B) Time course of nucleotide incorporation 
using RNA template 10 (5’-UAUAGUCCCAAA-3’) together with 
1M nsp8 and 10uM [a«°*P]GTP. Reactions were performed with 
all required nucleotides or lacking either ATP, UTP or CTP, as 
indicated. 


not significantly misincorporate a nucleotide (Figure 3B). 
Based on the observed rate of misincorporation, we could 
estimate a lower limit of 1 misincorporation per 10 nucleo- 
tides synthesized for nsp8 RdRp (data not shown, also see 
Figure 3 and Discussion). When ATP is omitted, a shift is 
observed from the usual dinucleotide product pppGA towards 
pppGG, indicating that nsp8 rather favors a shift in its 
initiating site over a misincorporation at the original site 
(Figure 3B). 
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Selective nsp8 inhibition using GTP analogs 

The nsp8 protein may represent an attractive target against 
coronaviruses, several nucleotides analogs (3’-dGTP, ddGTP 
and 2/-O-methyl-GTP) were tested for their capability to 


Table I Synthetic nucleic acid templates used in this study 


Oligo name Sequence Template usage 
efficiency (%) 
Oligo 1 5’‘-AAAAAAAAAGUAUCC-3’ 0 
Oligo 2 5'-UAUAAUECGAAA-3’ 35 
Oligo 3 5'‘-AAAAAAAAAAUUGCC-3’ 0 
Oligo 4 5'-UAUAAGECAAAA-3’ 30 
Oligo 5 5'-UAUAAUGGAAAA-3’ 0 
Oligo 6 5’-AAAACCUAAUAU-3’ 0 
Oligo 7 5'-UAUAAACGAACUUUAA-3’ 0 
Oligo 8 5'-AAAUCAAAA-3’ 0 
Oligo 9 5'-AUAUCGUUUACAAAA-3’ 0 
Oligo 10 5'-UAUAGUECCAAA-3’ 100 
Oligo 11 5'-AAGECCAAA-3’ 95 
Oligo 12 5'-UAUAAUGCAAAA-3’ 100 
Oligo 13 5'-UAUAAUGGAAAA-3’ 83 
Oligo 14 5'-UAUAAUEGCCAAA-3’ 80 
Oligo 15 5'-UAUAAUGCUAAA-3’ 56 
Oligo 16 5'-UAUAAUUGUAAA-3’ 69 
Oligo 17 5'-UAUUAAGGUAAA-3’ 53 
All oligonucleotides are RNA except oligonucleotide 11, which is a 


single-stranded DNA template. RNA synthesis initiation sites are 
shadowed in gray. From this table and other experiments (not 
shown), the best priming site was found to be 5’-G/UCCNN-3’ 
(see text). 

Oligo 10 is taken as a reference, and approximate relative template 
usage efficiencies are indicated on the right. 
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inhibit nucleotide incorporation into RNA using a poly(rC) 
template. 3’-dGTP was found to efficiently inhibit nsp8 RdRp 
activity (Figure 4), whereas ddGTP and 2’-O-methyl-GTP 
were weak inhibitors (data not shown). Using increasing 
3'-dGTP concentrations, most of the band products vanish, 
with no significant appearance of chain terminated products 
(Figure 4A). Comparatively, chain terminated products 
increasing over time are apparent when using HCV NSSB 
in a similar experimental setting (Figure 4B, asterisks) The 
fact that ladder-like product formation does not occur sig- 
nificantly using nsp8 suggests that most inhibition occurs at 
position +1 of the template. 


Conserved nsp8 residues are critical for RdRp activity 

In order to identify essential residues for nsp8 activity, 
alanine-scanning mutagenesis was applied to all conserved 
charged and selected polar amino-acid residues. In total, 14 
mutants of nsp8 were generated (Figure 5), expressed in 
E. coli and purified to apparent homogeneity using affinity 
and exclusion chromatographies. The RdRp specific activity 
of these mutants was measured using [* H]GTP incorpora- 
tion into RNA using a poly(rC) template, and was expressed 
as a percentage of the activity of the wild-type nsp8 (Figure 5 
and Supplementary Figure 3). The K58A, R75A, K82A and 
S85A mutations abolished or greatly reduced the RdRp 
activity. The nsp8 RdRp activity was most sensitive to 
replacement of either of the three residues, K58, K82 and 
S85, which are also conserved in the torovirus sequence. The 
D161A mutant remains substantially active. Although the 
nsp8 activity is metal-ion dependent, there are no essential 
and conserved acidic residues that may chelate a catalytic 


— — ‘ 
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Figure 4 Inhibition of RNA synthesis by 3’-dGTP using a poly(rC) template. (A) Effect of increasing concentrations of 3’-dGTP on the initiation 
and full-length RNA product formation. The nucleoside analog was added to reaction mixtures containing 1 1M poly(rC), 10M [a**-P] GTP 
and 1 uM nsp8. Lanes 1-5 show the products of the reactions performed using the following concentrations of 3’-dGTP: 0, 100, 500 nM, 1 and 
10 1M. Reactions were allowed to proceed for 60min. M, RNA marker synthesized using T7 RNA polymerase and an appropriate template. 
(B) Same as in (A) using 10 uM oligo(rC,;) as a template in conjunction with 1 14M HCV NSSB. Each chain terminated band product is indicated 
on the right by an asterisk (*). For each 3’-dGTP concentration (lanes 1-5: 0, 5, 10, 50 and 100 uM, respectively), the reaction was allowed to 


proceed for 30, 60 and 120 min. 
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Figure 5 Alanine scanning mutagenesis of the nsp8 protein. Indicated amino acids were replaced by alanine using site-directed mutagenesis. 
One micromolar of nsp8 wt and 1 1M of Ala mutants were tested for polymerase activity by measuring [*H]GTP incorporation using a poly(rC) 
template. Nsp8 polymerase specific activity is represented as follows: empty lozenge (©); 100% activity relative to nsp8 wt; black lozenge 
(@), between 12 and 40% activity and; asterisk (*), less than 4% activity (see Supplementary Figure 3 for precise values). 


Mn’* (see Discussion). Activities recorded with the nsp8 
mutants are thus fully consistent with the alignment pre- 
sented in Figure 1. 


A structure- and activity-based model of nsp&8 RdRp 
The crystal structure of nsp8 in complex with nsp7 has been 
determined recently (Zhai et al, 2005). Nsp8 and nsp7 form a 
hexadecameric toron structure able to encircle and bind RNA 
as judged by both the presence of a strong positive charge in 
the inner channel and biochemical assays. Nsp7 is thought 
to play the role of a mortar, bringing cohesive force to the 
complex with no obvious role in RNA binding (Zhai et al, 
2005). All conserved nsp8 residues, which are essential for 
the RdRp activity, are located on the second large alpha helix 
(Figure 1) with residues Lys-58 and Arg-75 surrounding, but 
slightly outside the channel. These residues were implicated 
in RNA binding. The essential residues map to a dimeric 
nsp8, which is part of two equivalent dimers in the hexa- 
decameric nsp8/nsp7 complex. 

Interestingly, the comparison of the folding of the head 
domain of nsp8 (corresponding to the most C-terminal 99 aa 
residues) shows similarity to a family of the RNA-binding 
domains (RBD) (Supplementary Figure 4) (Krissinel and 
Henrick, 2004). This family is characterized by the two 
ssRNA recognition motifs Rnpl and Rnp2 (Maris et al, 
2005). These motifs are composed of mainly hydrophobic 
and positively charged residues. Both the RBD and the head 
of nsp8 are folded into an «/f sandwich. The structure of the 
heterogeneous nuclear ribonucleoprotein D (hnRNP D), be- 
longing to the RBD family, was solved in complex with 
ssRNA by NMR (Enokizono et al, 2005). Identical connectiv- 
ity and sequential arrangement of secondary structural ele- 
ments are also conserved in the RdRp palm subdomain 
(containing catalytic Asp residues in the GDD motif) 
(Hansen et al, 1997) (Supplementary Figure 5), providing 
evolutionary hints about a possible nsp8 origin (see 
Discussion). 

Altogether, we can thus propose a model for a quaternary 
initiation complex involving two monomers of the nsp8 
protein, an ssRNA template (5’-UAGC-3’) and the first two 
complementary nucleotides incorporated (GTP and CTP). We 
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have superimposed the RNP motifs of nsp8 head domain onto 
the RNP motifs of hnRNP D. In this superimposition, the 
ssRNA is stacked by the hydrophobic surface of the two RNP 
motifs and points towards the inner part of the hexadecamer 
channel (Figure 6B). Thus, while an nsp8 molecule of the 
hexadecamer stacks the RNA template (Figure 6A, repre- 
sented in black), a second is able to bind the nascent primer. 
Then, the two first NTPs incorporated (GTP in +1 and CTP 
in +2) are bound to two strictly conserved basic residues 
Arg-75 and Lys-82 present on the second nsp8 molecule 
(Figure 6A, represented in clear gray). The positioning of 
the GTP primer against the helix implies that triphosphated 
dimers and longer primers would not fit into the active site, 
consistent with the fact that they are not elongated. 


Discussion 


Many genetic and mechanistic features distinguish the cor- 
onavirus replication machinery among those encoded by 
other RNA viruses. We have now discovered a second RdRp 
in SARS-CoV, the first of this kind in RNA viruses, thus 
providing further evidence for the unprecedented sophistica- 
tion of the replication complex in coronaviruses. In the 
context of other data accumulated in the field, the described 
nsp8 RdRp properties indicate that this enzyme may catalyze 
the synthesis of RNA primers for the primer-dependent nsp12 
RdRp. Although primers are used in genome replication by 
numerous RNA viruses, coronaviruses may be unique in 
evolving a specialized RNA enzyme for primer synthesis 
(primase). 

It was previously shown that the RdRp domain of corona- 
viruses is evolutionary clustered with RdRps of (+) RNA 
viruses that may use a protein (VPg-like) for priming RNA 
synthesis (Gorbalenya et al, 1989; Koonin, 1991). These 
RdRps are distinguished by the presence of the G sequence 
motif that was implicated in the primer/template recognition 
(Gorbalenya et al, 2002). Relevance of these observations 
for coronaviruses may be linked to the finding that the 
SARS-CoV nsp12 RdRp is active, in vitro, using a poly(rA)/ 
oligo(dT), 13 primer-template (Cheng et al, 2005). Since 
coronaviruses may not produce VPg, the identity of a primer 
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Figure 6 Model of two monomers of the nsp8 protein in complex 
with an ssRNA template (5’-UAGC-3’) and two nucleotides (GTP 
and CTP). RNA template, GTP and CTP are shown by a stick model. 
(A) The amino-acid residues related to null mutants, Lys-58 and 
Arg-75, are represented in yellow. The two first NTPs incorporated 
(GTP in +1 and CTP in +2) are indicated. Discontinuous purple 
line represents distance between GTP 3’-OH (incorporated nucleo- 
tide in position + 1) and «-phosphate of CTP (incorporated nucleo- 
tide in +2 position), estimated to 3.8 A. (B) The surface is colored 
according to the electrostatic potential nomenclature (blue, 
positive charge; red, negative charge). Images were generated 
using PYMOL. 


and the mechanism of its generation have remained unre- 
solved for these viruses. Besides VPg, cellular RNA molecules 
are recruited by some RNA viruses to prime polynucleotide 
synthesis in either replication or transcription of the genome. 
This strategy was adopted by influenza viruses and retro- 
viruses, which either hijack a piece of a cellular mRNA 
(Plotch et al, 1981) or use a tRNA (Mak and Kleiman, 
1997), respectively, as a primer. We propose that corona- 
viruses have evolved another strategy to produce primers, 
which may use the newly identified nsp8 RdRp as the 
primase. 

In the DNA world, primases are ubiquitous and it may be 
instructive to compare the nsp8 RdRp with these enzymes. 
The general function of a DNA primase is to synthesize an 
RNA primer on a DNA template. In most viruses and cellular 
organisms that replicate their genomes through semidiscon- 
tinuous DNA synthesis, DNA-dependent DNA polymerase 
(DdDp) recognizes the primer/template complex to extend 
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the primer-initiated synthesis of the complementary DNA 
strand (Frick and Richardson, 2001). In comparison to other 
template-dependent DNA and RNA polymerases, fidelity is of 
a less importance for DNA-dependent primases. Subsequent 
to the primer utilization in DNA synthesis, the RNA primer is 
removed and a gap in the nascent DNA is sealed with newly 
generated DNA by several enzymes including a high-fidelity 
DdDp. 

In coronaviruses, the replication complex is yet to be 
characterized in sufficient details in vitro and in vivo 
(Ziebuhr, 2005). Recent analysis indicates that it is likely to 
include virus-encoded RNA processing enzymes ExoN 
(nsp14), EndoU (nsp15), and a putative ribose-2’-O-methy]l- 
transferase (nsp16) (Snijder et al, 2003; Bhardwaj et al, 2004; 
Ivanov et al, 2004a; Ziebuhr, 2005; Minskaia et al, 2006), 
which could excise primers synthesized by the low-fidelity 
nsp8 RdRp (with a misincorporation rate of 1/10 as a lower 
limit). This excision could be part of the methyl directed- 
mismatch repair activity that is worth further testing. 
Similarly to the E. coli and eukaryotic primases, the nsp8 
RdRp exhibits a limited processivity. The prokaryotic DNA 
primases (i.e., from the E. coli DnaG family) generally 
recognize specific sequences on the template (Yoda et al, 
1988), while eukaryotic primases are sequence-independent 
(Bullock et al, 1994). The SARS-CoV nsp8 revealed marked 
sequence preferences and, like cellular primases, RNA synth- 
esis starts with a purine residue. Once the nsp8/RNA/nucleo- 
tides ternary complex has been formed, a rate-limiting step 
occurs before or during dinucleotide synthesis, a feature that 
is also common for cellular primases. In our hands, nsp8 and 
the purified nsp7-nsp8 complex exhibit comparable activities 
(not shown). The only noticeable biochemical difference 
between nsp8 and the nsp8-nsp7 complex, which may be 
the functional form of the nsp8 RdRp, is a relatively poor 
thermal stability of nsp8 (data not shown). Remarkably, this 
property was predicted upon examination of the crystal 
structure of the nsp8-nsp7 complex by Zhai et al (2005), 
who described nsp7 as a ‘mortar’ protein. 

Little is known about the initiation of RNA synthesis in 
coronaviruses although terminal sequences were implicated 
in the control of the process (Lai and Cavanagh, 1997). 
Depending on the polarity, plus or minus, and the size, 
genome or subgenome, single-stranded RNAs, partial sin- 
gle-stranded RNAs (known as replicative intermediates) and 
double-stranded RNAs (replicative forms) appear to serve as 
templates. The apparent complexity of the RNA synthesis 
may accommodate the postulated primase activity in differ- 
ent ways and the dissection of this aspect requires further 
analysis. 

We note here that the sequence specificity of the nsp8 
RdRp is not stringent and, potentially, this enzyme could 
initiate RNA synthesis at numerous internal places at the 
genome or its complement that would also be reminiscent of 
the modus operandi of cellular primases. In this way, the 
giant genome of coronaviruses could replicate much faster 
and, possibly, more accurately than it would otherwise using 
a single 5’-terminal primer. These properties could form 
a basis that has driven the origin of the primase in the 
coronavirus evolution. 

The identification of nsp8 as a potential primase should 
facilitate developing functional assays for studying the repli- 
case machinery in vitro. Our preliminary results show that in 
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agreement with the reported results and the proposed model, 
purified nsp8 and nsp12 interact in GST-pull down experi- 
ments (Imbert et al, unpublished data). The 3- to 5-fold 
excess of the nsp8 synthesis relative to nsp12, due to down- 
regulation of the latter by frameshifting (Brierley, 1995; Thiel 
et al, 2003), seems to be used to build the nsp7/nsp8 octamer 
complex containing four nsp8 subunits (Zhai et al, 2005). 
Consequently, equimolar stoichometric ratio between inter- 
acting nsp8 and nspl2 species may be maintained in the 
infected cell. The nsp8 has a unique bi-domain structure that 
is different from those of prokaryotic and eukaryotic pri- 
mases. Its C-terminal domain has a fold also found both in 
a diverse family of RNA-binding proteins and the catalytic 
palm-subdomain of RdRps (Hansen et al, 1997; see 
Supplementary Figure 5). This finding indicates that the 
nsp8 RdRp and nsp12 RdRp may have originated from a 
common ancestor, possibly through a duplication during 
evolution leading to the emergence of the ancestral virus of 
the Coronaviridae family. Previously, duplications of PLpro 
and Mpro were implicated in the evolution of the coronavirus 
proteome (Ziebuhr et al, 2001). 

The eight most conserved residues are distributed between 
the two domains of nsp8. Three polar and essential residues, 
Lys-58, Lys-82 and Ser-85, which may be part of the catalytic 
residues network for the phosphoryl transfer reaction, are 
located in the N-terminal domain. As in the case of the 
coronavirus Mn?*-dependent endonuclease nsp15, the 
metal-ion dependence in catalysis remains undefined and 
is unlikely to be promoted by acidic residues. However, our 
tertiary structure modeling analysis further suggests that the 
highly conserved Trp-182 in the C-terminal domain (head 
domain) is close to the a-phosphate of the + 2 nucleotide and 
might be involved in Mn** coordination promoting metal- 
based nucleophilic activation at the phosphorus center. 
Indeed, such interaction between aromatic residues and 
cation, termed cation-x (Dougherty, 1996), have been de- 
scribed as a non-covalent bonding interaction relevant for 
molecular recognition and catalysis (Zaric et al, 2000). A role 
for Trp-182 in catalysis is consistent with the fact that in the 
putative torovirus ortholog, the residue is also an aromatic 
residue (Tyr). Regardless of the precise composition of the 
catalytic center of nsp8, it differs from that conserved in the 
nsp12 RdRp, impeding to determine undoubtedly if these two 
RdRps have been acquired independently or have diverged 
profoundly. 


Materials and methods 


Materials and reagents 

RNA oligonucleotides were obtained from Dharmacon. DNA 
oligonucleotides were obtained from Invitrogen. A 373-nt template 
corresponding to nt 13905-14278 of the SARS-CoV genome (strain 
Frankfurt, GenBank Accession No. AY291315) was produced using 
an in vitro T7 transcription kit, and purified as described by the 
manufacturer (Ambion Inc.). Homopolymeric cytosine template 
(poly(rC)), 15-mers cytosine RNA oligonucleotide (oligo (rC;s)), 
a-**P-labeled guanosine 5/-triphosphate (3000Ci/mmol), «-**P- 
labeled cytosine 5’-triphosphate (3000 Ci/mmol), uniformly labeled 
(HIGTP (5.20Ci/mmol) and nucleosides 5’ triphosphate were 
purchased from Amersham Biosciences. Nucleosides analogs 3’- 
deoxy GTP, 2'-O-methyl-GTP and di-deoxy-GTP were purchased 
from Trilink, Inc. RNA molecular weight markers were synthesized 
as described in Dutartre et al (2005). HCV NS5SB and NSS Dengue 
polymerases were purified as in (Selisko et al, 2006). 
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SARS nsp8 plasmid constructions, E. coli protein expression 
and purification 

The SARS-CoV nsp8 coding sequence was amplified by PCR from 
the cDNA prepared as previously described (Drosten et al, 2003). 
The cDNA was then subcloned in the pDest14 plasmid (Invitrogen) 
in a manner analogous as nsp9 described in Campanacci et al 
(2003). The ORF of the final construct (referred to as nsp8) encoded 
an N-terminally 6 His-tag. This construct was mutated using the 
QuikChange site-directed mutagenesis kit, according to the manu- 
facturer’s instruction (Stratagene). All constructions were verified 
by DNA sequencing (Millegen, France). Proteins expression and 
purification were performed as described (Campanacci et al, 2003). 
Proteins were homogenous as judged by SDS-PAGE (see Supple- 
mentary Figure 1A). They were concentrated to 5 mg/ml and stored 
in 50% glycerol at —20°C. Recombinant proteins were characterized 
by dynamic light scattering and circular dichroism spectra, which 
were undistinguishable from wt nsp8. Enzyme concentrations were 
determined using UV,239 absorbance. No attempts were made to 
determine the proportion of active enzyme (enzyme active site 
concentration). 


Nsp8-mediated steady-state incorporation of nucleotide 
using RNA templates 

Polymerase activity was assayed by monitoring the incorporation 
of radiolabeled guanosine using either oligoribonucleotide or 
polycytosine (poly(rC)) templates. All indicated concentrations 
are final. The reaction was performed in an optimized polymerase 
buffer made of 50mM Tris pH 7.5, 10mM KCl, 4mM MgCl, 1mM 
MnCl,, 10mM dithiothreitol, 1% Triton X-100 containing 10 1M of 
[a-**P]GTP. The templates were either 10 1M RNA oligonucleotide, 
1uM poly(rC). Reactions were initiated by the addition of 1uM 
purified nsp8 and incubated at 30°C. Aliquots were withdrawn over 
time from 10s to 2h and the reaction was stopped by the addition of 
EDTA/Formamide. Reaction products were separated using sequen- 
cing gel electrophoresis (14% acrylamide, 7M urea in TTE buffer 
(89 mM Tris, 28 mM taurine, 0.5 mM EDTA)) and quantitated using 
photo-stimulated plates and a Fujilmager (Fuji). In some instances, 
nsp8 activity was quantitated using a filter paper binding assay. 
Reactions were initiated by the addition of 1 1M nsp8 in polymerase 
buffer containing 1M poly(rC) template, 0.1mM [*H]GTP 
(0.5 Ci), in the same buffer as above, incubated at 30°C, and 
stopped by spotting aliquots onto DE-81 paper discs (Whatman 
International Ltd). Filter paper discs were washed three times for 
10min in 0.3M ammonium formate, pH 8.0, washed two times in 
ethanol, and dried. The radioactivity bound to the filter was 
determined using liquid scintillation counting. Under these condi- 
tions, the nsp8 specific activity was consistently in the vicinity of 
62¢c.p.m.min'. 


Steady-state kinetic studies 

To determine the K,, and Vinax for CTP incorporation by nsp8, the 
RNA_ 5/-UAUAAGCCAAAA-3’ template (10uM) was mixed in 
polymerase buffer with 1M nsp8. The reaction was started by 
the addition of 10M [o-**P]GTP and increasing concentration of 
CTP (1, 5, 10, 50, 75, 100, 300 and 500 uM). To determine the K,, 
and Vmax for the incorporation of ATP, RNA 5’-UAUAGUCCCAAA-3’ 
was incubated with 10 uM [«-**P]GTP and increasing concentration 
of ATP (1, 5, 10, 50, 75, 100, 300 and 500 1M). Finally, the K,, and 
Vmax for GTP incorporation was determined using RNA 5’- 
UAUAGUCCCAAA-3’ template incubated with 104M [a-**P]CTP 
and increasing concentration of GTP (1, 5, 10, 50, 75, 100, 300 and 
500 uM). The reactions were incubated at 30°C for 15, 30, 60 and 
120 min. Aliquots were withdrawn during the time course of the 
reaction, and the reactions were quenched with EDTA/formamide. 
Products were separated using sequencing gel electrophoresis and 
quantified using photo-stimulated plates and a Fujilmager (Fuji). 
Product formation was represented by the hyperbolic equation 
describing V; dependence on NTP concentration 


Vi= Vinax(NTP)/(Km oe (NTP)) 


where Vinax and Ky, are the maximal velocity and the affinity 
constant of NTP incorporation by nsp8, respectively. Vinax and Ky, 
were determinated from curve-fitting using KaleidaGraph (Synergy 
Software). 
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Fold comparison 

The fold comparison was performed using the SSM server (protein 
structure comparison service SSM at European Bioinformatics 
Institute, http://www.ebi.ac.uk/msd-srv/ssm) with a truncated 
form of nsp8 termed ‘head’ domain (corresponding to the 
C-terminal 99a.a.). The superimposition between the head of 
nsp8 and a member of the RBD, the C-terminal of CstF-64 (PDB 
code: 1plt) (Perez Canadillas and Varani, 2003) shows a global 
RMSD of 2.8A. However, the RMSD calculated only for the two 
motifs is about 1A for RNP2 and 1.9A for RNP1 (Supplementary 
Figure 4). We also performed a structural alignment with different 
RNP members to check the correlation between the existence of the 
RNP motifs and the position in the structure. This alignment was 
carried out using MUSCLE (Edgar, 2004). Then, the alignment was 
analyzed and optimized with SeaView (Galtier et al, 1996), taking 
into account the secondary structure from the high-resolution 
models. 


Modeling 

The crystal structure of the nsp8 hexadecamer (Zhai et al, 2005) 
was submitted to 30 cycles of rigid body and 100 cycles of conjugate 
gradient with CNS (Brunger et al, 1998). Based on the structural 
alignment (Supplementary Figure 4), we have generated a super- 
imposition between hnRNP D with its ssRNA (PDB: 1lwtb) 
(Enokizono et al, 2005) and the head of one nsp8 monomer. Then, 
the position of the ssRNA was manually adjusted using Coot 
(Emsley and Cowtan, 2004) to avoid steric clash and to correct the 
direction of the ssRNA backbone to point out towards the central 
cavity of the hexadecamer. Another minimization cycle was 
performed taking into account the presence of the RNA template 
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(as described above). To model the initiation state of nsp8, we 
docked the first two nucleotides base pair complementary (GTP in 
+1 and CTP in +2) to the ssRNA template. These two nucleotides 
are the first two to be incorporated in the nascent RNA chain. These 
two nucleotides of the new strand were manually docked on the top 
of the head respecting Watson and Crick base pairing dictated by 
the template. A last round of energy minimization was performed 
on this quaternary structure (primase/template/incorporated nu- 
cleotides). 


Supplementary data 
Supplementary data are available at The EMBO Journal Online 
(http://www.embojournal.org). 
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